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I hereby certify that this correspondence is being deposited on the date indicated below as first class mail with the 
U.S. Postal Sen/lce addrestSl^o the ASSISTANT COMMISSIONER FOR PATENTS, WASHINGTON, DC 20231 . 
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Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 0 

Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-30S-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT \yAX, 703-303 4216. 
PATENTIN 2.1 e-mail help: Datin21heln@uspto.gOV or phone 703-3Cje»-4il9 (R. Wax) 
PATEP3TIN 3.0 e-mail help: patin3help@uspto.gOv or phone 7d3-30&r4ll9 (R; Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS. PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: ^^^^^^^^^^v J 



Checker Version 3.0 '■ry i '. - -- 

The Checker Version 3.0 application is a state-of the-art V/indows based software program 
employing a logical and intuitive user-interfece to check whether a sequence, listing is in 
compliance with format and content rules. Checker Version 3 .0 works for sequence listings 




O'gSAI-T'Sl.S „ G 



Ra^^bqocnce Listmg Error Suntflfary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS. WHICH WERE INSERTED BY PTO SOFTWARE 



1 



Wrapped Nucleics 



Wrapped Aminos 



The number/lext at the end of each line "Vvrapped* down to the next line. 
This may occur If your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent *\vrapping**. 

The amino acid number/text at the end of each line Vrapped " down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Incorrect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 



Misaligned Amino Acid 
Numt>ering 



The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. !t is recommended to delete any tabs and use spacing between the numbers. 



Non-ASCII 



This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 



Variable Length 



Patenlln ver. 2.0 "bug" 



Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 

A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally. Patentin wou-d automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Sequence(s) , missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i; SEQUENCE CHARACTERISTICS: (Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is Intentionally skipped 



Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 



Skipped Sequences 
(NEW RULES) 



Use of n's or Xaa's 
(NEW RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have t>eon detected in the Sequence Listing. 
Use of <220> to <223> is ^4ANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa. and which residue n or Xaa represents. 



1 1 Use of "Artificial" Use of "Artificiar only as "<21 3> Organism" response is incomplete, per 1 .823(b) ot New Sequence Ru'es. 

(NEW RULES) Valid response is Artificial Sequence. 



12 Use of <220>Feature 

(NEW RULES) 



Sequence(s) _ 



_ are missing the <220> Feature and associated headings. 



Use of <220> to <223> is MANDATORY if <213>ORGANISM Is "Artificial Sequence" or 'Unknown- 
Please explain source of genetic nviterial in <220> to <223> section. 

(See "Federal Register," 6/01/98. Vol. 63. No. 104. pp. 29631-32) (See. 1.823 of new Rules) 



13 



Patentin ver. 2.0 "bug- 



Please do not use **Copy to Disk" function of Patentin version 2.0. This causes a corrupted 

file, resulting in missing mandatory numertc identifiers and responses (as Indicated on raw sequence listing). 

Instead, please use "File Wlanager or any other means to copy file to floppy disk. 



AMC - Blotechnoloov Systems Branch - 4/Q6/2001 



--> 



OIPE 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/8 4 7,513 



DATE: 05/16/2001 
TIME: 13:37:02 



Input Set : A: \MBA10101 , txt 

Output Set: N:\CRF3\05162001\I847513.raw 



<110> APPLICANT: MBARI 
DeLong, Edward 
Beja, Oded 

<120> TITLE OF INVENTION: Light-driven energy generation using proteorhodopsin 
<130> FILE REFERENCE: MBA- 101 

<140> CURRENT APPLICATION NUMBER: US/09/847,513 

<141> CURRENT FILING DATE: 2001-05-01 

<150> PRIOR APPLICATION NUMBER: 60/201,602 

<151> PRIOR FILING DATE: 2000-05-03 

<160> NUMBER OF SEQ ID NOS : 65 

<170> SOFTWARE: Patentin version 3.0 

<210> SEQ ID NO: 1 

<211> LENGTH: 105184 

<212> TYPE: DNA 

<213> ORGANISM: Naturally occurring gamma proteobacterium 
<220> FEATURE: 

<221> NAME/KEY: CDS (complement) 
<222> LOCATION: ( 50866 )..( 51 615 ) 

<223> OTHER INFORMATION :. light-driven px^ct.on pump; has the properties of 

oton pump when expressed with retinal in Escherichia col 
<300> PUBLICATION INFORMATION: 



Does Not Comply 
Corrected Diskette Needed 




light-driven pr 



Aravind,L,, Koonin,E.V., Suzuki, M.T., Hadd, A Nguyen , L , P . , 
Gates, CM., Feldman , R . A . , DeLong,E.F 

a new type of phototrophy in the sea 



<301> authors: Beja,0. 

Jovanovich, S . ? . , 
<302> TITLE: Bacterial rhcdopsin: evidence for 
<3 03> JOURNAL: Science 
<304> VOLUME: 289 
<305> ISSUE: 5486 
<306> PAGES: 1902-1906 
<3C7> DATE: 2000-09-15 

<308> DATABASE ACCESSION NO: AF279106 
<309> DATABASE ENTRY DAIE : 2000-06-15 
<313> RELEVANT RESIDUES 
<400> SEQUENCE: 1 



acacccattt 



. (51615) 








taacttaata 


ctaatatata 


attagtttat 


60 


ttttacacta 


aatgcatttt 


cttgctcaga 


120 


ttccgctgag 


tcaagaaacc 


tttgcgagtt 


ISO 


tgcaagtaga 


tgtggttaca 


cttatcaata 


240 


taaagatgaa 


gattttctag 


taattgggat 


300 


tgatgaaagc 


gatgttgcag 


aattttgttc 


36C 


ctcaactgct 


aaagtcaaag 


gaaaaaaagc 


420 


atcaggtttt 


actccctcat 


gcaactttaa 


460 


tgtatccaca 


tatggatcaa 


aggtaaagcc 


540 


aggcttgctg 


taaaattatt 


acttagaaac 


600 


tattccatta 


tctacaactc 


caggaatatt 


660 


tgaaatatcc 


atattagaga 


tatctaaaat 


720 


tctatatgtg 


ggtattccac 


cgatcgagat 


780 


aggtatcacc 


tctataggca 


gtggaaaagc 


840 



e://C:\CRF3\Outhol(IVVsrI847513.htm 
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RAW SEQUENCE LISTING DATE: 05/16/2001 

PATENT APPLICATION: US/09/847,513 TIME: 13:37:02 

Input Set : A:\MBA10101.txt 

Output Set: N:\CRF3\05162001\l847513.raw 



72 tcccaaaaga ttaaccatct. ttgactgatc aactatacat ataaactcgt 
74 agcaactatc ttttctctag tatgtgcgcc accaccacct ttaataagac 
76 agacacctca tctgcaccat ctatgtaata agctatatca actacatcat 
78 gacctctatc ccattttcat ttaataattt tgatgaagca tctgaactag 
80 agcaaatttg tgcctatgct cctttagttc ttctataaaa aaattaactg 
82 tccaatacct aaaatcatct caggatgaag attatttttg atatattcta 
84 agcaacattt atctttgagc cactcataga gttataatac aagaaaatat 
86 ttattttgag actaaaaatt aaaaaaacag gttcttttaa gaattcccag 
88 agcttattcc taatgcatct gtttatgacg tagcattaaa atcacctata 
90 taaatatttc ttcaaagctg gggaataaag ttttcctaaa aagagaggat 
92 tattttcttt taaaaacaga ggagcgtata acaagattgt aaatttatcc 
94 agaagagggg ggttattgct gcatcagcag gaaatcatgc tcaaggggta 
96 gtaagaaatt aaaaattaat tgcttgatag ttatgccaat aacaactcca 
98 taaaagatgi: aaaaagattt ggagccaaaa tactccaaca tggggacaac 
100 cattaaaaga ggcactgttt attgcaaaga aaaaaaaatt gtcttttgtt 
102 acgaccctct aacaattgct ggccaaggga ctataggaca agaaattctt 
104 ataattttga tgttgtcttt gttccggtgg gaggaggagg tattctagct 
106 cctggatagc acagaataat aagaaaataa aaattgttg.g tgttgaggtt 
•108 cttgtcttgc tgaggccgta aaagctaata aaagagttat tttaaaagaa 
110 ttgctgatgg ggtggcagta tcaagggttg gaaaac.at'ia ttttgatgtt 
112 gcgtagatga agtcattaca gttagcgttg atgaggtctg caccgctgta 
114 ttgaagatac aagggttcta tcagaacctg ctggggcatt agcacttgca 
116 cctacgcaag gaaagttaaa aataaaaaac ttattqctat aagttctggc 
118 atttccaaag acttaatttt attgttgagc gatcag?gat tggtgaaaat 
120 tattaagtat caaaatccca gagatacctg gaagttttct taagctttca 
122 gcagctctca agttacagag tttaactaca ggaaatctag cttaagcgat 
124 tagttggtgt tagaactaaa actgaaaaat catttgaaat cttaaagtcc 
126 aagcaggctt cacctttaac gactttactc gaaatgaaat atccaatgat 
128 atatggttgg tggcagaaat agtgactcag gctctcataa caatgaaaga 
130 gagagtttcc tgagaagccg ggcgcgctgt taaattttct agagaaattt 
132 ggmatatttc cttatttcat tacaggaacc taggttcagc ttttggaaag 
134 gcatcgagag taaggatiiaa gacaagctaa taaatcattt aaataagtca 
136 ttacagaaga aacctctaac aaggcataca aagatttttt aaaatgaaag 
13 8 taatctaaat ttaattgaaa aaagctcatc gctagggttt tcccacggct 
140 ctcggattga gatctatcat cctcctcgtc gtaaattctc ccacctttag 
142 aaatagatat gacaaaggag - cgafjctcata tttatatcta atttggaacg 
144 agtattaaat tcattaacta tattatttcc tttataaagg tatccattag 
146 aatactaata ggattttttg ctttcaaagc aacaaattga ctcttaagtc 
148 tttattattt ttaaaccae.t ttagatcaaa agataaggta tcttgtcttg 
150 ggcaagatta ttattatcct gccatatcag ccattcattt tcctttctta 
152 cgcattgatt cttaagttat catttggaaa tattgaacct gctatcttgt 
154 accataccca ttcgaatccc accgattatc tttctctccc ttaaagaagc 
156 gtcatatgtc cagaatgaat agttctttgc ctcaaagtct gctgtaatac 
158 ttttgatttg ataaaggggt aggcttcatt ttttcttgtg atagttgtat 
160 agatctaaag ttaaaatcta attgaaattt agagttgtcc ttaaaactaa 
152 ttgatcgatg cctattggat ttgaattacc agctgtgtca gcatcataat 
164 tccataatct atttgtttta atatcgagct attatcaaat tcatttattt 
166 tccaatacca gcatgaatcc agtctcttct ttgcagataa ccaaagtcat 
168 gtcgtcttca aaataaagaa ggcttccact tatgttagat agtttatttg 



tagaggcaga 
aattttcagg 
taaggctaaa 
aaacagctcc 
ttgagccggt 
tagcttgttt 
aggtagttaa 
aagtacctaa 
acatttgctc 
ctgcaaccta 
gatgccgaaa 
gccagtgcat 
gaaataaaaa 
gtagatgcag 
catccttttg 
gaagataaaa 
ggtgtatctg 
gaggattccg 
gtgggcctct 
attaaagagt 
aaagatatct 
gggttaaaag 
gctaatgtaa 
agagaaaaaa 
aggatgtttg 
gcatatgttt 
aaattaaaaa 
catctgaggc 
atatttaggg 
ggaaataaat 
at attaattq 
gTOactattt 
?ttae 



gtTaatactt 
ctttgaacaa 
aatagaccaa 
aagctacgcc 
catctgaaaa 
tgatctcatg 
aatcatatga 
ttctatattg 
aaaattttct 
taactctcca 
ctattcttct 
ttttcccaga 
aagaattttt 
ttagatcaat 
ttcgatttcc 
ttaactcgaa 
gaagatatgt 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560- 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



2760 

2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/847,513 



DATE: 05/16/2001 
TIME: 13:37:02 



Input Set : A:\MBA10101.txt 

Output Set: N:\CRF3\05162001\I847513.raw 



170 aaactgagtc ctatacccaa gcccattttt accatctttt tctgaggcta acagatctga 
172 atatgtaatt aatttttttg aacgaatatt gatgtaatca ataacattga ctgttgatga 
174 ttcgcctgtc atttcattct caacattcgt taccatgaag ccaagcgttt tatttccaag 
176 ctttgttcga gatcgaagag cataatagtc tcttccaact gaaaaggctt catcagcctc 
178 acttgctaca aatactccaa attcattatt attacttttt tgagtaagtc ttaatgcaaa 
180 atcaatatca gaatagtttt ttttagctgc ctcgcaacct tcttcattac tttcttctga 
182 gcaattatag ctgggggcag ctccaatcct ccgcgtattt ataaccgagt atctatcata 
184 attactaata tcaaatagtg attggttttc attgaaaaat gctctttttt ctgagtaaaa 
186 agtttcttga gcagaaaagt taataaccac atcatcactc tcagcttgtc cgaaatctgg 
188 attaatagct aaatttattt gacgaccttt tccagtgcta taaaagattt cagccccaat 
190 atctgaccct tcttggtttg taactgaatt tttatttgaa gatatatatg gaaaaaaagt 
192 aagctttgat tttgtatagt tttgtatttc taagctatct aactcttgaa agtagtcatt 

194 tctactagcc attgttccgg cactgctaac ccatgactca ttttcggcac tataacgtaa 

195 tgcggtgtaa ttaatttttc ttatatcacc atcaggctgt ttcattaacg ttacatccca 
198 aggaataaaa aactcagaga cccaataccc atcaaatttt tgtgtttttg caatccaatc 
200 tccatcccag tctgtcttaa agtctcctgc ttgcgttttt atggcatcga aaagcgagtt 
202 cccaagattt atagcaagaa tgaaagcttt gttaccatca ccaLcaaagt ctatatttat 
204 agagtttttd tcgctaagtg aatrtatttg atctctiagc gtcttcctgg agaacataga 
206 atcattactt tgaaaattct taaatccaac atatatacca tccttatttg agaaaattaa 
208 agccgttgta agaagttcat ttttcttaag agtaaa.agga gatgtttcat aaaaatctgt 
210 aatttcaaat gcattattcc actcaggctc atcaagagag ccatcaataa caattgagtt 
212 tgaccaaatc agcacagagg taagtaaaag tgataatgag gctaaagtta ttttcataga 
214 tagatttaat tcagagtatt ttaatctatg aagagtagga aatcatccga tcattctcag 
216 aaaaaacata atcatattta aggtcatgtt tttctccaaa atgcatatta caaatctggt 
218 attgataaca cagtccaacc aataaaggtc ttgatacctc ctcgtttata gagccaataa 
220 ttctatcgaa atatccagag ccatagccaa gcctgtatcc atttaaatca actcctgtca 
222 taggaataaa cattaaatca atttcattta tgttgacata atcttcactt ttaacctctt 
224 tgatcccaaa ttgatttata aagaagttag gctgctcatc cagcaaatta aaagccatca 
226 tttcgtcatc aattactttt ggaatataaa tattcttttt aa.gtttagta aaagcttgaa 
228 ttaacaaatt tgtattgact tcatttcgaa aaggaaaata taaggcaatg ttttgcattt 
230 catgagtatt gattttttct aatacatttt cttggattaa aaaacttara ttgctcttag 
232 ataaatcaga aatagactgc ccctgctcaa agagtgattt tcgtatttta tttttcacca 
234 taaattgtgc cgaggctaac aaaccaaatc accgccatgt gctatacctg aaccgatggg 
236 tcaggtggtg aatttcataa caaatcaggc ttccctgtaa aggtactgcg caacaatgct 
238 agaggaatat cacctattta attgtatcgg ttcaaattta ttaac?catt agcgtatgaa 
24 0 ccagaatgca gttaattata aaatatatag gtattaagta aaagttaatt tttagagagc 
242 agactctatt ttttgtatta gcttttcgat atctttatta tttaaagcag agtcagtatt 
24 4 ctcggttggc ttggataaaa gcttactagc catagttaga ccagctatca ccaaggcatt 
246 attcttatca ctgatgccat caagctcatc atttaataat tttgcagctc taatgagttg 
24 8 gtctttctct tctggcggac aggctaacgt cagatctcgg ccaaaaattc ttaaagatag 
250 ogtttccatt tctgacatta ttttttgagt ttagttattt ctttatttag tatgagctta 
252 tcctttttcc agtctctttc cctttgcaca taagaatcta ttattccttt ttgctcatca 
254 aatcttttta tcaaaagatc taccttatcc tctagttcaa ataatgatga ttttttttct 
256 tctgacattg tataagttta attgtactag aatgaattgg aaagtttctt taataaaaga 
258 gtaaaataag gcatggaaaa aataattttt aaaaacagaa gagattcact tataaaacac 
260 ctacctaaga attcagcctt aattgtacct ggtgcagatt tgcaatatag aaatgctgac 
262 tcatcttata atttaagaca ggaaagtagc ttctactatc tgtctggctt ttgtgagccc 
264 tcttctctaa tggttttagt taataatgga aaaagcattg attcaataat ttttgttcct 
266 gaaaaagata aacttaaaga aatctgggat ggttatcgag ctggccctga gggcgcaata 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
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6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/847,513 



DATE: 05/16/2001 
TIME: 13:37:02 



Input Set : A:\MBA10101.txt 

Output Set: N:\CRF3\05162001\l84 7 513.raw 



268 aatgattttc tttttgatca agcttttgaa aataataaat cagatgcttt aatgcctgaa 
270 atccttcaag ggctagaaaa agttttttat tcaataggga agaaaaatgg ctttgatcag 
272 aaagtaattg actggacatg cgcagcaaat tctaaagata ggcacagcaa atcaattgat 
274 attattgatg gctcttcgat ggtaggaaat ttaaggctta tcaaagataa gcatgaaatt 
276 gatattatga agagagcttg tgaaatttca gctgaatcat atattgaggt catgaaatct 
278 ataaagcctg gggacaatga gcaggaaata gaggcgctat ttttatatga attcgccaaa 
280 aggggaggaa ggtttccagc ttatacacct atagttgctg gaggtgaggg tgcttgtgta 
282 ttgcattata ttgaaaatga taaagagtta gcttcatcag atttaatttt ggtagacgca 
264 ggatgtgaat acaaaatgta tgcatctgat atcacaagaa ccttcccagt aagtggaaaa 
286 ttttcagatg aacagctaca aatttataat attgtccaca aagccaatct tgctgcaatc 
288 gatgctgtaa aaactggtaa tagcataatg gagccccaaa tggtttcaga aaaagtaatt 
.290 actgaaggtc ttgtagagtt gggtattcta tctggcgatg ttaatcagct tcataaaaat 
292 ggtgcattca aggactttta tatgcataag gtgggacatt ggcttggact tgatgttcat 
294 gatgttggtg actacatgga gggagatgag tttatgaagt ttaagccagg gatgataacc 
296 acaatcgagc caggcatcta tatcagtagc gcaatggatg tagatgacaa atggaaaggc 
298 atcggcataa gaatagagga cgacatcctt gtaacagatt caggcaatat taatctaaca 
300 gagaaggtgc catctaatcc tcaagaaata gaatcattga tggcttagac tatggaggtt 
302 ccaattgtta tttctggcgg agggataata ggtaattaca tttctcttag gcttgaaaaa 
304 aataatatca aaaccgttat tgtcgaaae.a gctag-cagtt tcaaagccct agataagggt 
306 ataagaacag tcactctcaa tgagcattct atgcaeatgc taaaaaatat tggtatttgc 
308 ccatcaattg ctcaaatcaa cagcatcgac gtattagatg gtgagggtac aggcaaaatt 
310 caatttctag caaaggacgt aggcagcgaa aacctttcat atgtaaccta tttcaatgaa 
312 ttacaaaaac taatttctga tccatgtaaa gaaagaacct tatttaataa tgagattgat 
314 tcagttcaga atcttaatac agaatctgc.t ccagagatca tgcttaaaga tggcatgacc 
316 ataaaaacga atctaattgc tggatgtgat ggaagaaatt caaatattgc aaaaattgct 
318 tcacttacaa gcagcttcga tgactactta caaacagctt taacttttgt cgttgatatt 
320 gataatgatt cacatggcaa agctcaccaa gttttttctg aaaaaggaat atttgcactt 

322 atgccactcc cagaaggcaa gggtgagatg aataaatgca cagtggtttg gtcaataaaa 
324 aatcaagttt tgggagatga gcctgtatct gagtttgtaa aaaataacat ttcttttttt 
326 gaatcaacigc ttaatgttag tctcagggtt aagtcagaaa ttttaagttt taaattatcg 

323 aaccatcatt ttgaaaacta tattagcgga cctattgttc ttcttggtga tgctgctcac 
330 tcaattcacc ccttagcagg tcaaggtatt aatctaggat ttgcagatgc agatactttt 
332 tgtgaagagg taattagttc ttataaaaaa gggattgcct ttaatgagaa atcagtttta 
334 aaaagatatg agattagaag aaaaagtatg aactttttaa tgttgaagtc tatggacttt 
336 tttgtggatt tatttggttc agaaaattta tatctaaggc tgatcagaaa tttaggtatt 
338 tcttcgttaa ataaatcacia gtttgttaaa gcattcttta taagacatgc ctctggaatg, 
340 aataagtttt aaattcgtat taaacttttt gacctttagc tctaagttct ttaagaactt 
342 cactaatgcc ttttttatca atgattctca taccttttgc agatacttta agatttacga 
344 acctgttctc agattcaacc caaaatttgt gtgtgtgaag attaggaaaa aactttcttt 
34 6 tagtcctatt tttagcgtga gaaacattgt ttcctgactg tggtatctta cctgttactt 
348 gacatatttt actcattgaa acgcgatttt atagaacact gaggaactta gcaatactat 
350 tgtgaaacaa atttatttat tacggcatgc acaatctgac tgggagagct ctaatcagaa 
352 agattttgat agaccattag caagaaaagg cattgaagaa gcaaataaaa tatcatgtta 
354 ctgcaaatct cattcaattt tagtagataa aatattctgt agcactgcag aaagaactaa 
356 gcagactttt gatatatgca gtgatgggct taattatcca atagctgaag cagtctatac 
358 tgatgagctt tacttttctg gccctggtga gatcgttaag cttatccaaa gtttaagtga 
360 attcatttcc tctgttttaa taataggcca caatccatca atgcaaatgt acatagatgc 
362 tatttcagaa aatcctcata ttacgtattc aacatgcggg ctggcagaaa ttctcgttga 
364 aagttcatgg aaagacttat ctttaaaaaa atgtaagtta aaatctttta ttcaaccagg 
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366 agagctttaa aaattgaaaa acataaaaat taagatcatt aacccactaa tgggatccaa 
368 gataccctta cctcaatatg aaacaaaggg ctcggcagga ttggatttaa gggcatgcct 
370 agatagtaat ctcagccttc aagcaggaac atctcagttg atacctattg gttttgcaat 
372 gtacttagaa gatcccggtc ttgcagcaat ggttatacct agatcaggtt taggttctaa 
374 gcatggaatc gttcttggta atctggttgg gttgattgat tcagactatc aaggagagct 
376 aatggttcct gcctggaata gatcagatac agattttgag attaatcctg gagacaggat 
378 tgcacaaatg attatagttc cagtgattca agcagatttt gaaattgtag acgagttcaa 
380 tgagactcag aggggagaaa agggttttgg aagttcaggt ataaattgat aaatttactt 
382 tttcttgcca aatctttctt caaatttctg aactcttccg ccagtatcaa taattttttg 
384 cttaccggta taaaaaggat gagaagcaga ggatatatca agagggtagt atgggtatgt 
386 ttttccatct tcccattctt tcgtttgagt cgtatctaat gttgaacgaa tgagaaagaa 
388 cttatcagca ctagcgtcat ggaataaaac ttcacggtat tcaggatgta tatctttttt 
390 caudataac.a tcaaaatttg gatgagaact ataacaaaaa acaacttctt ttcaatcaaa 
.392 acatgaaaat tttttactat gacatagctg tttcattgcc tctaaggcaa tgctttactt 
394 atagttctga gcttaaaatt acaaaaggaa cccgcgtgtc agttcctttt ggtaaaagga 
396 agattgtagg agtggttata aaaaatattc aaaagccaga tttcctaaaa aaagcgggag 
398 ctataaaaaa aattatcgct gtacttgatg aatatccttt gttcgacaag ccaatttttg 
400 attctatatt gtggtcttct gattactatc atcatcctat tggtgaggtt tttaatacct 
402 ttatacccac cgaattaaga aaaatta?.ta ataaaaaaat tgaagcttta agagaatttt 
4 04 ctgaatattc agtaaatgag gatgataaga aattc-gattt a.accaaggat caagaaaaag 
406 cagtcaaggc gctttctaaa tctaaaggat tttcacccac tttattatat ggagttacag 
408 ggtctggaaa aacagaagtt tacttaagag ttgccgaaac ttttattaaa aataataagt 
410 cagtattagt tttagttcca gaaataaatt taacccccca attgctctcg cgatttgaga 
412 atagatttaa tggtgaaatt ggcatatatc attctaagca aacagcagct aagagattaa 
414 agacttggct aaaagctaaa tttggttcta taaaaataat agtaggaact cgatcttcgg 
416 ctttagtgcc tttagataac attggtttaa taattatcga tgaagagcat gaccaatcat 
418 ttaggcagtc agaagggttt aaattctctg ctagagactt aagtataaaa agggcacagc 
420 ttgcagatat tccaattatt ttgggatcag caaccccttc gctgcaaact ttaaaacttg 
422 taaaagaaaa taaatttata agagttgata ttcctaatcg agttgatgga aacaagcctc 
424 ctaaattaat agccttagat atcaataaca gccctttaat aggcggagtt gctaaagaga 
426 caattgaagc aatgcaatja accatagaca gaggagaaca ggttctagtt tttattaata 
4 28 gc-cgagga^:t cgctccactc tatcaatgta gtagttgtgg gtgggtagca gattgtaaat 
4 30 cttgtgatac aaatttagtc ttccaccagg caagaaatag attaatttgt cataggtgtg 
4 32 aatctgccta ctctgttaat ttgtcttgtc cggcatgcaa gtctaatgac tttaatatgt 
434 atgr;agctgg aacagagaga gttgaagaag ttcttaaaag cagntttgta aagactccaa 
436 taattagagt tgatcatgac tcaacaaaaa aagtgggagc tatggaggct atagttaaaa 
438 aaattcattc ctcagacgca gcaattttag ttggaactca aatgcttgca aaaggacatg 
440 attttcctaa agtcacctta agcgttattt taaatgctga taatggcctt ataagcccag 
442 aaattaatgc attagagaaa atatctcaat tgcttattca ggtctctgga agagcaggaa 
444 gaaataataa tcttgcaaaa gttattattc aaacaagata tcctgatgat. ataaatctta 
446 atav.aattaci gacaggagat tatatgaaat ttgcttctca atgcctaagt accaatgagc 
448 aaatgaactt acctccattt actactttat gtctgcttag gtqctcatca ccaactcaaa 
450 agagtaatgt agatttccta gagaaagctg ttttaatttt atccaatagg actgatataa 
4 52 atgttattgg tcccttgcct tcattagttt cgaagtcgaa aggaaattat aggcaccaag 
4 54 tctatatcca tgcaccaaag aagacttttt taaataaggt attaaagttt ttgacaacag 
456 agtttgaaaa atggccggaa tctaataagg ttaagtggtc tttcgacatt gatccaatag 
458 acttaagcta aatattaatc ttaattaatt gtcctgggta tattggttta ttgtttagtt 
4 60 tattctctgt attaatttct tctacagtca ccccaaatct tatcgctatt tctgataaga 
4 62 catccccttt ttgtattttg taagtcacaa agcctggatc aatactcata aaggtatttg 
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<210> 4 
<211> 750 
<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1) . . (750) 

<223> light-driven proton pump; has the properties of a light-driven pr 
oton pump when expressed with retinal in Escherichia coli. Note 
that additional three nucleotide residues incorporated by per pri 
ming with reference to the original 31A08 DNA sequence (DNA resid 

^ ues 4-6, ggt) , adding a new restriction site for cloning 
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11 M:270 C: Current Application Number differs, Replaced Current Application No 
11 M;271 C: Current Filing Date differs, Replaced Current Filing Date 
24 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 1 
134 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 

3559 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:2 
3572 M:286 W: Invalid Database Entry: Valid Formats YYYY-MM-DD or Mon-YYYY, SEQ ID:2 
3584 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 3 
3597 M:286 W: Invalid Database Entry: Valid Formats YYYY-MM-DD or Mon-YYYY, SEQ ID:3 
3615 M:259 W: Allowed number of lines exceeded, <223> Other Information: 
3790 M:286 W: Invalid Database Entry: Valid Formats YYYY-MM-DD or Mon-YYYY, SEQ ID:6 
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